Transcribing against time
نویسندگان
چکیده
We investigate the problem of manually correcting errors from an automatic speech transcript in a cost-sensitive fashion. This is done by specifying a fixed time budget, and then automatically choosing location and size of segments for correction such that the number of corrected errors is maximized. The core components, as suggested by previous research [1], are a utility model that estimates the number of errors in a particular segment, and a cost model that estimates annotation effort for the segment. In this work we propose a dynamic updating framework that allows for the training of cost models during the ongoing transcription process. This removes the need for transcriber enrollment prior to the actual transcription, and improves correction efficiency by allowing highly transcriber-adaptive cost modeling. We first confirm and analyze the improvements afforded by this method in a simulated study. We then conduct a realistic user study, observing efficiency improvements of 15% relative on average, and 42% for the participants who deviated most strongly from our initial, transcriber-agnostic cost model. Moreover, we find that our updating framework can capture dynamically changing factors, such as transcriber fatigue and topic familiarity, which we observe to have a large influence on the transcriber’s working behavior.
منابع مشابه
The Effect of Transcribing on Beginning Learners’ Phonemic Perception
A large number of studies dealing with phonology have focused their attention on phonological production at the expense of phonological perception which provides the foundation stone for phonological production. This study focuses on phonological perception at phonemic level. The purpose of the study is helping beginning learners improve their perception of the English phonemes which are confus...
متن کاملRNA polymerase locations in the simian virus 40 transcription complex.
Transcription complexes of simian virus 40 can be isolated from cells late in infection in a form that retains the ability to continue transcription in vitro. These complexes have been investigated previously to gain information about the nucleoprotein structure of a transcribing gene. However, several studies have reported that the RNA polymerase molecules in such complexes are located almost ...
متن کاملDiversity of GABAergic interneurons in layer VIa and VIb of mouse barrel cortex.
Neocortical layer VI modulates the thalamocortical transfer of information and has a significant impact on sensory processing. This function implicates local γ-aminobutyric acidergic (GABAergic) interneurons that have only been partly described at the present time. Here, we characterized 85 layer VI GABAergic interneurons in acute slices of mouse somatosensory barrel cortex, using whole-cell cu...
متن کاملVesicular Stomatitis Virus Polymerase's Strong Affinity to Its Template Suggests Exotic Transcription Models
Vesicular stomatitis virus (VSV) is the prototype for negative sense non segmented (NNS) RNA viruses which include potent human and animal pathogens such as Rabies, Ebola and measles. The polymerases of NNS RNA viruses only initiate transcription at or near the 3' end of their genome template. We measured the dissociation constant of VSV polymerases from their whole genome template to be 20 pM....
متن کاملThe Voice Transcription Technique: Use of Voice Recognition Software to Transcribe Digital Interview Data in Qualitative Research
Transcribing interview data is a time-consuming task that most qualitative researchers dislike. Transcribing is even more difficult for people with physical limitations because traditional transcribing requires manual dexterity and the ability to sit at a computer for long stretches of time. Researchers have begun to explore using an automated transcription process using digital recordings and ...
متن کامل[Changes in mental workload and fatigue during performance of a mental task. 1. An experiment in 8-h self-paced transcribing task].
The changes in mental workload and fatigue during a one-day transcribing task were examined by determining some subjective and physiological measures which reflect mental activity. With an interval of one week between the three test days, 12 male students rested and performed self-paced transcribing task with moderate and maximum effort for 8 h each. The subjects transcribed more characters in ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 93 شماره
صفحات -
تاریخ انتشار 2017